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What is claimed is: 

1. .A method for feature selection based on 
hierarchical local -region analysis of feature 
characteristics in a data set, comprising: 

5 partitioning a data space associated with a data set 

into a hierarchy of pluralities of local regions; 

using a similarity metric to evaluate for each local 
region a relationship measure between input features and a 
selected output feature; and 
10 identifying one or more relevant features, by using 

the relationship measure for each local region. 

2. The method of claim 1 further comprising: 
determining a feature relevancy of a selected feature 

15 by performing a weighted sum of the relationship measures 
for the selected feature over the plurality of local 
regions. 

3. The method of claim 2, wherein weights for the 
20 weighted sum are based on sizes of the respective local 

regions . 

4. The method of claim 1, wherein the partitioning of 
the data space into the hierarchy of pluralities of local 

25 regions is performed by hierarchical clustering of the data 
set in a plurality of levels. 
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5. The method of claim 4, wherein feature relevancies 
are determined for each of the input features based on the 
relationship measures at each level of the hierarchical 

5 clustering and the relevant features are identified based 
on the feature relevancies. 

6. The method of claim 1 further comprising: 
determining for each local region a corresponding 

10 subset of relevant features based on the relationship 
measures for the local region. 

7. The method of claim 6, wherein the subsets of 
relevant features for respective local regions are non- 
15 identical. 

8. The method of claim 1, wherein the local regions 
are nonoverlapping . 

2 0 9. The method of claim 1, wherein the similarity 

metric is linear. 

10. The method of claim 1, wherein the similarity 
metric includes a projection or distance. 
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11. The method of claim 1, wherein the relationship 
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measure includes a correlation. 

12. The method of claim 1, wherein the relationship 
measure includes R 2 . 

5 

13. A computer system, comprising: 
a processor; and 

a program storage device readable by the computer 
system, tangibly embodying a program of instructions 
10 executable by the processor to perform the method claimed 
in claim 1. 

14. A program storage device readable by a machine, 
tangibly embodying a program of instructions executable by 

15 the machine to perform the method claimed in claim 1. 

15. A computer data signal transmitted in one or more 
segments in a transmission medium which embodies 
instructions executable by a computer to perform the method 

20 claimed in claim 1. 

16. A method for feature selection based on 
hierarchical local -region analysis of feature 
characteristics in a data set, comprising: 

2 5 partitioning a data space corresponding to a data set 

into a hierarchy of pluralities of local regions; 
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on each level of the hierarchy, using a similarity 
metric to evaluate for each local region in the level a 
relationship measure between input feature values on the 
one hand and a selected output on the other hand; and 
5 determining a relevancy of a selected feature by 

performing a weighted sum of the relationship measures for 
the feature " over the plurality of- local regions at 
appropriate levels . 

10 17. The method of claim 16, wherein the partitioning 

of the data space is performed through hierarchical 
clustering of the data set in a plurality of cluster 
levels . 

15 18. The method of claim 17 further comprising: 

identifying relevant features at each level of the 
hierarchical clustering and determining corresponding 
feature relevancies. 

20 19. The method of claim 16, wherein weights for the 

weighted sum are based on sizes of the respective local 
regions . 

20. The method of claim 16 further comprising: 
2 5 ranking the input features according to the 

corresponding feature relevancies of the input features. 
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21. The method of claim 16, wherein the local regions 
are nonover lapping . 

5 22. The method of claim 16, wherein the. similarity 

metric is linear. 

23. The method of claim 16, wherein the similarity 
metric includes a projection or distance. 

10 ... . . 

24. The method of claim 16, wherein the relationship 
measure includes a correlation. 

25. The method of claim 16, wherein the relationship 
15 measure includes R 2 . 

26. A computer system, comprising: 
a processor; and 

a program storage device readable by the computer 
2 0 system, tangibly embodying a program of instructions 
executable by the processor to perform the method claimed 
in claim 16 . 

27. A program storage device readable by a machine, 
2 5 tangibly embodying a program of instructions executable by 

the machine to perform the method claimed in claim 16. 
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28. A computer data signal transmitted in one or more 
segments in a transmission medium which embodies 
instructions executable by a computer to perform the method 
claimed in claim 16. 



